seo

Crawled — Currently Not Indexed: A Coverage Status Guide

Ali JalilPour June 27, 2024

0 0 3 minutes read

If you’re consistently seeing your URLs getting filtered out of the index, you’ll need to take steps to make your content more unique.

While there is no one-size-fits-all standard for achieving this, here are some options:

Rewrite the content to be more unique on high-priority pages.
Use dynamic properties to automatically inject unique content onto the page.
Remove large amounts of unnecessary boilerplate content. Pages with more templated text than unique text might be getting read as duplicate.
If your site is dependent on user-generated content, inform contributors that all provided content should be unique. This may help prevent instances where contributors use the same content across multiple pages or domains.

Table of Contents

8. Private-facing content

Priority: High

There are some instances where Google’s crawlers gain access to content that they shouldn’t have access to. If Google is finding dev environments, it could include those URLs in this report. We’ve even seen examples of Google crawling a particular client’s subdomain that is set up for JIRA tickets. This caused an explosive crawl of the site, which focused on URLs that shouldn’t ever be considered for indexation.

The issue here is that Google’s crawl of the site isn’t focused, and it’s spending time crawling (and potentially indexing) URLs that aren’t meant for searchers. This can have massive ramifications for a site’s crawl budget.

Solution: Adjust your crawling and indexing initiatives.

This solution is going to be entirely dependent on the situation and what Google is able to access. Typically, the first thing you want to do is determine how Google is able to discover these private-facing URLs, especially if it’s via your internal linking structure.

Start a crawl from the home page of your primary subdomain and see if any undesirable subdomains are able to be accessed by Screaming Frog through a standard crawl. If so, it’s safe to say that Googlebot might be finding those exact same pathways. You’ll want to remove any internal links to this content to cut Google’s access.

The next step is to check the indexation status of the URLs that should be excluded. Is Google sufficiently keeping all of them out of the index, or were some caught in the index? If Google isn’t indexing a large amount of this content, you might consider adjusting your robots.txt file to block crawling immediately. If not, “noindex” tags, canonicals, and password protected pages are all on the table.

Case study: duplicate user-generated content

For a real-world example, this is an instance where we diagnosed the issue on a client site. This client is similar to an e-commerce site as a lot of their content is made up of product description pages. However, these product description pages are all user-generated content.

Essentially, third parties are allowed to create listings on this site. However, the third parties were often adding very short descriptions to their pages, resulting in thin content. The issue occurring frequently was that these user-generated product description pages were getting caught in the “Crawled — currently not indexed” report. This resulted in missed SEO opportunity as pages that were capable of generating organic traffic were completely excluded from the index.

When going through the process above, we found that the client’s product description pages were quite thin in terms of unique content. The pages that were getting excluded only appeared to have a paragraph or less of unique text. In addition, the bulk of on-page content was templated text that existed across all of these page types. Since there was very little unique content on the page, the templated content might have caused Google to view these pages as duplicates. The result was that Google excluded these pages from the index, citing the “Crawled — currently not indexed” status.

To solve for these issues, we worked with the client to determine which of the templated content didn’t need to exist on each product description page. We were able to remove the unnecessary templated content from thousands of URLs. This resulted in a significant decrease in “Crawled — currently not indexed” pages as Google began to see each page as more unique.

Ali JalilPour June 27, 2024

0 0 3 minutes read

How to Approach Owned and Earned Media

It’s Your Turn: Now Accepting Community Speaker Pitches for MozCon 2015

5 Things I Learned About E-A-T by Analyzing 647 Search Results

How Google’s Rankings Algorithm Has Changed Over Time

Link Building Using Unique Original Content and Other Techniques

Is Google Using ITA Data with New Flights Onebox?

How to Build a Facebook Funnel That Converts

Untapped Search Verticals

12 Easy Mistakes That Plague Newcomers to the SEO Field

Exactly How Powerful Are Tweets & Retweets? Help Us Find Out!

How to Use Domain Authority 2.0 for SEO

Screen Size Matters: Adapting Content Strategy for Multiple Devices

Crawled — Currently Not Indexed: A Coverage Status Guide

8. Private-facing content

Priority: High

Solution: Adjust your crawling and indexing initiatives.

Case study: duplicate user-generated content

Ali JalilPour

Leave a Reply Cancel reply

Web hosting for SEO: Why it’s important

SEM career playbook: Overview of a growing industry

What Is SEO – Search Engine Optimization?

My Top Three Mozinar Takeaways

A Sneak Preview of the London Pro SEO Seminar 2010

Announcing the New Moz SEO Essentials Certification: What It Is & How to Get Certified

How I Develop Successful Link Building Strategies for My Clients

Optimizing for AI Overviews

My Top 5 Local SEO and Marketing Takeaways From MozCon 2024

How I Develop Successful Link Building Strategies for My Clients

Top SEO Tips for 2024 — Whiteboard Friday

Intro to Python [Part 2]

8. Private-facing content

Priority: High

Solution: Adjust your crawling and indexing initiatives.

Case study: duplicate user-generated content

Subscribe to our mailing list to get the new updates!

We Need to Talk About Google's “People Also Ask”: A Finance Case Study

Defense Against the Dark Arts: Why Negative SEO Matters, Even if Rankings Are Unaffected

Related Articles

Leave a Reply Cancel reply

Web hosting for SEO: Why it’s important

SEM career playbook: Overview of a growing industry

What Is SEO – Search Engine Optimization?

My Top Three Mozinar Takeaways

A Sneak Preview of the London Pro SEO Seminar 2010

Announcing the New Moz SEO Essentials Certification: What It Is & How to Get Certified

How I Develop Successful Link Building Strategies for My Clients

Optimizing for AI Overviews

My Top 5 Local SEO and Marketing Takeaways From MozCon 2024

How I Develop Successful Link Building Strategies for My Clients

Top SEO Tips for 2024 — Whiteboard Friday

Intro to Python [Part 2]